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Abstract 

Obligate endocellular symbiotic bacteria of insects and other organisms generally exhibit drastic genome reduction. Recently, 
it was shown that symbiotic gut bacteria of some stinkbugs also have remarkably reduced genomes. Here, we report the 
complete genome sequence of such a gut bacterium Ishikawaella capsulata of the plataspid stinkbug Megacopta 
punctatissima. Gene repertoire and evolutionary patterns, including AT richness and elevated evolutionary rate, of the 
745,590 bp genome were strikingly similar to those of obligate y-proteobacterial endocellular insect symbionts like Buchnera 
in aphids and Wigglesworthia in tsetse flies. Ishikawaella was suggested to supply essential amino acids for the plant-sucking 
stinkbug as Buchnera does for the host aphid. Although Buchnera is phylogenetically closer to Wigglesworthia than to 
Ishikawaella, in terms of gene repertoire Buchnera was similar to Ishikawaella rather than to Wigglesworthia, providing 
a possible case of genome-level convergence of gene content. Meanwhile, several notable differences were identified 
between the genomes of Ishikawaella and Buchnera, including retention of TCA cycle genes and lack of flagellum-related 
genes in Ishikawaella, which may reflect their adaptation to distinct symbiotic habitats. Unexpectedly, Ishikawaella retained 
fewer genes related to cell wall synthesis and lipid metabolism than many endocellular insect symbionts. The plasmid of 
Ishikawaella encoded genes for arginine metabolism and oxalate detoxification, suggesting the possibility of additional 
Ishikawaella roles similar to those of human gut bacteria. Our data highlight strikingly similar evolutionary patterns that are 
shared between the extracellular and endocellular insect symbiont genomes. 

Key words: Ishikawaella capsulata, Megacopta punctatissima, extracellular gut symbiosis, genome reduction, genome 
sequencing. 



Introduction 

Insects represent the majority of eukaryotic biodiversity in the 
terrestrial ecosystem (Grimardi and Engel 2005), and many 
insects harbor symbiotic bacteria in their gut, body cavity, 
or cells (Buchner 1 965; Bourtzis and Miller 2003). Many bac- 
teriocyte-associated endocellular symbiotic bacteria like 
Buchnera of aphids and Wigglesworthia of tsetse flies are es- 
sential for survival and reproduction of their host insects, 
showing stable maternal inheritance and host-symbiont 
cospeciation. These obligate endocellular bacteria generally 
exhibit peculiar genetic traits, such as AT-biased nucleotide 
composition, accelerated molecular evolution, and drastically 
reduced genome size less than 1 Mb (Shigenobu et al. 2000; 



Akman et al. 2002; Gil et al. 2003; van Ham et al. 2003; 
Degnan et al. 2005; Nakabachi et al. 2006; Perez-Brocal 
et al. 2006; Wu et al. 2006; McCutcheon and Moran 
2007, 2010; Lopez-Sanchez et al. 2009; McCutcheon 
et al. 2009; Sabree et al. 2009; Kirkness et al. 2010). These 
genetic traits are hypothesized to be the result of stable and 
nutrition-rich endocellular environment and also the conse- 
quence of attenuated purifying selection due to small popu- 
lation size and strong bottleneck, which are associated with 
the endosymbiotic lifestyle of the vertically transmitted sym- 
bionts (Wernegreen 2002; Moran et al. 2008; Moya et al. 
2008). Meanwhile, facultative insect endosymbionts like 
Wolbachia, Sodalis, Hamiltonella, Regiella, Serratia, and 
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Arsenophonus have larger genomes of 1-4 Mb and exhibit 
these genetic traits to much lesser extents (Wu et al. 
2004; Toh et al. 2006; Klasson et al. 2008, 2009; Degnan 
et al. 2009, 2010; Wilkes et al. 2010; Burke and Moran 
201 1), which may be relevant to less specialization and oc- 
casional horizontal transmission of the facultative microbial 
associates (Wernegreen 2002; Moran et al. 2008; Moya 
et al. 2008). 

In this context, extracellular symbiotic microbes in the 
alimentary tract of host insects may seem unlikely to exhibit 
such reductive genome evolution, considering that such 
associations are unstable outside the host body cavity, their 
lifestyle seems closer to that of free-living microbes, and thus, 
more genes are to be needed to cope with environmental 
fluctuations. However, recent studies have revealed that 
some stinkbugs harbor a specific y-proteobacterial symbiont 
in the midgut cavity, which is essential for host growth 
and reproduction, is vertically transmitted through host 
generations, exhibits host-symbiont cospeciation, and shows 
reduced genome sizes in the range of 0.7-0.9 Mb (Hosokawa 
et al. 2006, 201 0; Kikuchi et al. 2009). In particular, stinkbugs 
of the family Plataspidae are known for their unique mech- 
anism for vertical transmission called "symbiont capsule." 
When female insects lay eggs on their host plant, small 
brownish particles are deposited together, wherein the sym- 
biotic bacteria are encased. The bacteria in the capsules are 
ingested by newborn nymphs and colonize the midgut. In the 
developmental course, oddly, the nymphal midgut is con- 
stricted and separated into anterior and posterior parts. In 
adult insects, the anterior midgut is free of the symbiont, 
and the posterior midgut is transformed into a voluminous 
organ for harboring a huge amount of symbiont cells in 
the cavity. Judging from the peculiar anatomy, the plant 
sap ingested by the insect is completely absorbed in the 
anterior midgut, the waste is excreted through the Malpigh- 
ian tubules into the hindgut, and there is no food flow 
through the posterior midgut (Hosokawa et al. 2005, 
2006). Structural, functional, and evolutionary details of 
the symbiont genome are of interest but totally unknown. 

In this study, we determined the complete genome se- 
quence of such a gut bacterium "Candidatus Ishikawaella 
capsulata" (hereafter referred to Ishikawaella for simplicity) 
associated with the plataspid stinkbug Megacopta puncta- 
tissima, which unveiled strikingly similar evolutionary pat- 
terns shared between the extracellular and endocellular 
insect symbiont genomes. 

Materials and Methods 

Symbiont Genomic DNA Preparation 

We used an inbred strain of M. punctatissima, which was 
initially collected at Kobe, Japan, in 1999 from the kudzu 
vine (Pueraria lobata) and maintained in the laboratory on 
soybean plants {Glycine max) and pea pods {Pisum sativum). 



An adult female was dissected in a phosphate-buffered sa- 
line, and a symbiotic section of posterior midgut was iso- 
lated and subjected to DNA extraction. Aliquots of the 
DNA sample were subjected to quantitative polymerase 
chain reaction (PCR) of the symbiont groEL gene and the 
host elongation factor 1a gene as described (Hosokawa, 
Kikuchi, and Fukatsu 2007). 

Genome Sequencing, Gene Prediction, and Anno- 
tation 

The DNA sample was subjected to whole-genome shotgun 
sequencing as described (Akman et al. 2002; Toh et al . 2006). 
We constructed small-insert (2 kb) genomic libraries and gen- 
erated 1 2,247 sequences, giving 1 5-fold coverage from both 
ends of the genomic clones. Sequence assembly was carried 
out using the PHRED-PHRAP-CONSED package (Gordon 
et al. 2001). Remaining gaps were closed by sequencing 
of clones that spanned the gaps or by direct sequencing. 
To exclude the possibility of sequence error, we assessed 
and confirmed the quality of the assembled sequence. The 
correct assembly was confirmed by genomic restriction frag- 
ment patterns of pulsed field gel electrophoresis. Putative 
protein-coding sequences (CDSs) were predicted using Glim- 
mer2.0 (Delcher et al. 1999). The annotation of CDSs was 
based on results of BlastP searches against Escherichia coli 
genome and the NCBI nonredundant protein database. 
The CDSs exhibiting database matches to a functional gene 
of other bacteria, but interrupted by frameshifts and/or stop 
codons, were annotated as pseudogenes. To find pseudo- 
genes in spacer regions between CDSs, BlastX searches were 
also conducted against the database using the spacer as the 
query sequence. Transfer RNA (tRNA) genes were predicted 
by tRNAscan-SE (Lowe and Eddy 1997). Other noncoding 
RNAswere identified by similarity to E. coli homologs. Repet- 
itive sequence regions were identified using Tandem Repeats 
Finder 4.0 (Benson 1999). The theoretical isoelectric points 
of proteins were calculated using Compute pl/Mw tool 
(Bjellqvist et al. 1 993). The bacterial insertion sequences were 
predicted using ISfinder (Siguier et al. 2006). 

Molecular Phylogenetic and Evolutionary Analyses 

A set of 50 ribosomal protein genes, for which orthologs were 
commonly identified in Ishikawaella and other y-proteobacterial 
representatives, was selected for phylogenetic analyses 
(supplementary table S1, Supplementary Material online). Each 
of the ortholog sets was aligned using MAFFT 5.6 (Katoh et al. 
2005), and all the alignments were concatenated. The Whelan 
and Goldman (WAG)+r+lnv substitution model for the amino 
acid sequences was selected under the Akaike criterion using 
ProtTestv1.4(Abascaletal.2005). Molecularphylogeneticanal- 
yses were conducted by three methods, maximum likelihood, 
Bayesian, and neighbor-joining, using RAxML Version 7.0.0 
(Stamatakis 2006), MrBayes 3.1 .2 (Ronquist and Huelsenbeck 
2003), and PHYLIP 3.6 (Felsenstein 2005). Bootstrap values 
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for maximum likelihood and neighbor-joining phylogenies 
were obtained by 1,000 resamplings. Posterior probabilities 
were estimated for Bayesian phylogeny. A relative rate test on 
the basis of amino acid distances calculated from the concate- 
nated alignment of the 50 ribosomal protein sequences was 
performed using RRTree (Robinson-Rechavi and Huchon 
2000). Gene content clustering was conducted as described 
(Wolf et al . 2002). A set of clusters of orthologous groups of pro- 
teins (COGs) (Tatusov et al. 2003) represented in each genome 
wasdetermined.ThegenecontentsimilarityindexSwasdefined 
as follows: given that the set of COGs contained in the bacterial 
genomes A and B are shown as a and b, respectively, 5 between 
the genomes A and B was calculated as |anb|/|a Ub\. The gene 
content distance D was obtained as 1 - 5, and a pairwise dis- 
tance matrix for the analyzed bacterial genomes was subjected 
to neighbor-joining clustering. Bootstrap values were obtained 
by generating 1 ,000 replicates of the entire set of 4,873 COGs. 

Functional Analysis of Predicted Coding Genes 

The predicted protein-coding genes were classified on the 
basis of COG classification with some modification. Meta- 
bolic pathways were examined and verified according to 
the pathway descriptions in the EcoCyc and KEGG data- 
bases (Kanehisa et al. 2008; Keseler et al. 2009). 

Reconstruction of Ancestral Genomes 

To determine orthologous coding genes, we performed 
reciprocal BlastP searches between genes encoded in the 
E. coli K12 genome and genes encoded in each of nine 
insect symbiont genomes (1 Ishikawaella, 1 Baumannia, 
4 Buchnera, 2 Blochmannia, and 1 Wigglesworthia) with 
a cutoff E value at 10~ 5 and retained genes that exhibited 
significant hits in both comparisons. For duplicated genes, 
their orthologous relationships were also verified by molec- 
ular phylogenetic analyses. To search pseudogenes, TBIastN 
searches were conducted against the E. coli protein data set 
using spacer sequence between estimated open reading 
frames (ORFs) in the symbiont genomes as query. ORFs 
shorter than 40 amino acids with no significant homology 
to any bacterial genome sequences were discarded. Gene 
content was determined as distinct orthologous gene 
groups (OGGs) for each of the genomes of E. coli and 
the insect symbionts. By compiling all the OGGs from nine 
insect symbiont genomes without duplication, we obtained 
1 ,02 1 distinct OGGs in total. Then, we inferred the status of 
the OGGs on each of the ancestral nodes in the symbiont 
phylogeny by applying a parsimony principle to minimize to- 
tal number of gene loss events, where we ignored the pos- 
sibility of acquisition of new genes via lateral gene transfer in 
the streamlined insect symbiont genomes. Genome size of 
an ancestral node was estimated by adding the genome size 
of its descendant to the sum of the OGG lengths lost be- 
tween the ancestral node and the descendant. The length 
of the E. coli homolog of the OGG was assumed to be 




Fig. 1. — (A) An adult female of Megacopta punctatissima. (B) A 
posterior midgut dissected from an adult female. The symbiotic midgut 
section used for shotgun library construction is shown. (O A trans- 
mission electron micrograph of the symbiotic midgut section, wherein 
symbiont cells are densely packed. Abbreviations: hg, hindgut; mep, 
midgut epithelium; Mpt, Malpighian tubule; and sym, symbiont cell. 

the length of the lost gene. Because an ancestral node 
has two descendants, the estimated genome size of the an- 
cestral node was obtained by averaging the values for two 
descendants. 

Results and Discussion 

Preparation and Sequencing of the Ishikawaella 
Genome 

An adultfemale of M pmcfaf/ss/ma (fig. was dissected, and 
a symbiotic section of posterior midgut was carefully isolated 
(fig. ^B). The large midgut section contained a huge amount 
of Ishikawaella cells (fig. 10 from which 0.82 |ig of total 
DNA was prepared. Quantitative PCR assays evaluated the rel- 
ative abundance of 9,1 00 symbiont groEL gene copies per host 
elongation factor 1 oc gene copy in the DNA sample. Given the 
symbiontgenomesizeas0.8Mb(Hosokawa etal.2006)andthe 
host genome size as presumably 500 Mb or so, purity of the Ish- 
ikawaella genome in the DNA sample was estimated to be 
[9,100 x 0.8]/[9,100 x 0.8 + 1 x 500] x 100 = 87%. The 
DNA sample was subjected to shotgun library construction 
and Sangersequencing. Of 1 2,247 sequence reads determined, 
10,665 and 361 were assembled into a circular bacterial chro- 
mosome and a circular plasmid, respectively. Hence, 90% 
(11,026/12,247) of the reads represented the Ishikawaella 
genome, which agreed with the quantitative PCR estimate. 

General Features of the Ishikawaella Genome 

The main genome of Ishikawaella consisted of a circular 
745,590 bp chromosome encoding 611 putative protein- 
coding ORFs with an average size of 987 bp, which covers 
81 % of the whole genome. Of these, 568 were assigned to 
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^Translation 

| | RNA processing and modification 
I iTranscription 

^| Replication, recombination and repair 
|Cell cycle, division, chromosome partition 
^] Defense mechanisms 
^^Signal transduction 

| Cell wall/membrane/Cell envelope 

□ Intracellular trafficking, 
secretion, and vesicular transport 

□ Posttranslational modification, 
protein turnover, chaperones 



U Energy production and conversion 

Carbohydrate metabolism 
Q Amino acid metabolism 
| | Nucleotide metabolism 
I I Coenzyme metabolism 
| | Lipid metabolism 
U Inorganic ion metabolism 
| | Secondary metabolites metabolism 
| | General function prediction only 
I I Function unknown 
□ No homologous ORFs 



Fig. 2. — A circular view of the Ishikawaella genome. On the GC skew 
circle, red and blue indicate GC-rich and poor, respectively. On the CDS 
circle, colors indicate functional categories as shown at the bottom. 

putative biological functions, 34 matched hypothetical pro- 
teins of unknown function, and 9 were unique to Ishikawaella. 
Some of the putative lshikawaella-spec\f\c ORF might not be 
true genes, considering that eight of nine were less than 
190 bp in size. Three ribosomal operons, three structural 
RNA genes, and 37 tRNA genes assigning all 20 amino acids 
were identified (supplementary table S1, Supplementary 
Material online; fig. 2). Besides these genes, 35 ORFs and 
one 1 6S rRNA gene were truncated and/or interrupted by stop 
codons, which are probably pseudogenes (supplementary 
table S2, Supplementary Material online). In addition, Ishika- 
waella hasa 9, 1 39 bp circular plasmid, named pAst, containing 
eight ORFs (supplementary fig. S1, Supplementary Material 
online). Phylogenetic analyses based on 50 ribosomal proteins 
revealed that Ishikawaella is placed in a well-supported clade in 
the y-Proteobacteria together with obligate endocellular sym- 
biotic bacteria of other insects, such as Buchnera of 
aphids, Baumannia of sharpshooters, B loch man nia of ants, 
and Wigglesworthia of tsetse flies (fig. 3). The phylogenetic 
relationship suggested that the extracellular symbiont 



Ishikawaella shares a common ancestry with the obligate en- 
docellular symbionts and that their common ancestor might 
have already experienced reductive genome evolution to some 
extent. It should be noted that comparison of these symbiont 
genomes provides an ideal opportunity to investigate how the 
extracellular condition in the gut cavity and the endocellular 
conditions in the bacteriocytes have affected their genome 
evolution. 

Similarity of the Ishikawaella Genome to Endocel- 
lular Insect Symbiont Genomes 

General genomic features of Ishikawaella were strikingly 
similar to those of endocellular symbiotic bacteria, com- 
monly exhibiting small genome sizes, high AT contents, high 
inferred pi values for encoded proteins, and few mobile ge- 
netic elements (table 1 ). The evolutionary rate of Ishikawael- 
la was significantly higher than those of free-living bacteria 
and was equivalent to those of the endocellular symbiotic 
bacteria: nearly equal to those of Buchnera and Baumannia 
and slightly lower than those of Blochmannia and Wiggles- 
worthia (supplementary table S3, Supplementary Material 
online). The genome of Ishikawaella was about six times 
smaller than the genome of E. coli, and almost all the Ishi- 
kawaella ORFs (97.0%; 589/607) had their orthologs in the 
genome of £ coli, indicating that the /s/?//caivae//a genome is 
a small subset of the genome of free-living y-proteobacteria, 
like the genomes of Buchnera and obligate endocellular 
symbionts of other insects (supplementary fig. S2A-D, 
Supplementary Material online). 

Metabolic Capacity and Putative Biological Role of 
Ishikawaella 

Despite the drastic genome reduction, the Ishikawaella ge- 
nome retained many genes responsible for basic cellular pro- 
cesses such as translation, replication, energy production, 
etc., as many endocellular symbiont genomes (supplemen- 
tary table S4, Supplementary Material online). Many genes 
involved in metabolism of amino acids were conserved in the 
Ishikawaella genome (figs. 4 and 5; supplementary table S4, 
Supplementary Material online). Gene content analysis in- 
ferred that Ishikawaella is capable of synthesizing almost 
all essential amino acids and some nonessential amino acids 
and some vitamins and cofactors (figs. 4 and 6). Only the 
final step enzyme, ilvE, involved in the synthesis of branched 
essential amino acids, namely isoleucine, leucine, and valine, 
was missing in the Ishikawaella genome (fig. 6A). It should 
be noted that the gene is also lacking in the Buchnera ge- 
nome (Shigenobu et al. 2000). Presumably, absence of the 
gene is complemented by corresponding enzymes either 
from Ishikawaella itself or from the host insect (supplemen- 
tary table S5, Supplementary Material online). Given that 
plataspid stinkbugs feed exclusively on plant sap devoid 
of essential amino acids and some vitamins, Ishikawaella 
probably compensates for the nutritional deficiency of 
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Fig. 3. — Phylogenetic placement of Ishikawaella in the y-Proteobacteria. A maximum likelihood phylogeny inferred from concatenated 50 
ribosomal protein sequences (6,435 aligned amino acid sites) is shown. Statistical supports (>70%) for each clade are shown at each node in the order 
of maximum likelihood, Bayesian, and neighbor-joining analyses. Asterisks denote statistical support values lower than 70%. 



the diet, as Buchnera does for the host aphid (Douglas 1 998; 
Baumann 2005). 

Similarity in Gene Repertoire between Ishikawaella 
and Endocellular Insect Symbionts 

The extracellular symbiotic life in the gut cavity seems to 
entail molecular, cellular, and physiological requirements 
different from the endocellular life in the bacteriocyte. 
For example, the environment in the gut cavity may be less 
homeostatic than that in the cytoplasm, which would lead 
to retention of genes needed for free-living life, such as 
transcription factors for metabolic regulation. The extracel- 
lular symbiont may be more frequently exposed to me- 
chanical and osmotic stresses, which would result in 
conservation of genes related to cell wall synthesis and for- 
mation. The extracellular symbiont in the gut cavity is less 
likely to be affected by host innate immunity, which might 
affect its cell surface molecules. Hence, we initially ex- 
pected that the gene repertoire of Ishikawaella might be 
divergent from those of obligate endocellular symbionts 
of other insects. However, simple comparison of their gene 
contents indicated no such explicit tendencies (supplemen- 
tary fig. S2E-V, Supplementary Material online). Cluster 
analysis of their gene contents revealed that 1) the gene 
repertoire of Ishikawaella was the most similar to those 
of obligate endocellular insect symbionts Buchnera, Bau- 
mannia, Blochmannia, and Wigglesworthia, 2) among 



them, the Ishikawaella genome was the closest to the 
Buchnera genome and the most distant from the Wiggles- 
worthia genome, 3) obligate endocellular symbiotic bacte- 
ria of deep-sea mollusks and termite-associated 
protozoans were placed just outside the cluster, and thus 
4) on account of the gene repertoire, Ishikawaella was, de- 
spite its extracellular niche, nested within microbial clusters 
exclusively consisting of endocellular symbiotic bacteria 
(fig. 7). These patterns may provide some insights into evo- 
lutionary aspects of the reduced symbiont genomes. Firstly, 
regardless of their endocellular or extracellular habitats, 
the reductive genome evolution was commonly observed. 
Secondly, regardless of their phylogenetic affinities, diverse 
symbiont lineages have experienced such reductive ge- 
nome evolution. Hence, it appears likely that the reductive 
genome evolution is attributable neither to their cellular 
habitats nor to their phylogenetic placements but rather 
to some general traits commonly associated with diverse 
obligate bacterial symbionts, namely the small population 
size, the lack of recombination, and the deletional bias in- 
herent in all bacterial genomes (Wernegreen 2002; Moran 
et al. 2008; Moya et al. 2008). We also suggest that the 
host insect ecology and the physiological involvement of 
the symbiont might have also affected the evolutionary 
patterns. Plataspid stinkbugs and aphids are phloem sap 
feeders, wherein the symbionts supply essential amino 
acids for the hosts (Douglas 1 998; Baumann 2005). Tsetse 
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Table 1 

General Features of the Genomes of Ishikawaella, Endocellular Insect Symbionts, and Free-Living y-proteobacteria 



Bacterium 



Ishikawaella 
capsulata 



Buchnera 
aphidicola 
APS 



Baumannia 
cicadellinicola 



Blochmannia 
floridanus 



Wigglesworthia 
glossinidia 



Escherichia 
coli K12 



Vibrio 
cholerae 01 



Host insect 
Symbiotic niche 
Biological function 



Stinkbug Aphid Sharpshooter 
(Plataspidae) (Aphididae) (Cicadellidae) 
Extracellular Endocellular Endocellular 
(midgut cavity) (bacteriocyte) (bacteriocyte) 
Essential amino Essential aminoVitamins and 



Carpenter ant 
(Formicidae) 
Endocellular 
(bacteriocyte) 
Essential amino 



Tsetse flies 
(Glossinidae) 
Endocellular 
(bacteriocyte) 
B vitamins and 



Not applicable Not applicable 

Free-living Free-living 
(vertebrate gut)(vertebrate gut) 
Not applicable Not applicable 



acids (inferred) a acids; riboflavincofactors (inferred) d acids (confirmed) e cofactors (confirmed) f 
(confirmed) b ' c 



Chromosome (bp) 


745,590 


640,681 


686,192 


705,557 


697,724 


4,639,675 


4,033,464 


Plasmid 


1 


2 


0 


0 


1 


0 


0 


A+T content (%) 


70 


74 


67 


73 


77 


49 


53 


Coding content (%) 


83 


88 


89 


84 


89 


88 


88 


Predicted proteins 


611 (8) 


564 (10) 


596 


583 


611 (6) 


4,243 


3,835 


Ribosomal RNAs 


9 


3 


6 


3 


6 


22 


25 


tRNAs 


37 


32 


39 


37 


34 


89 


98 


Small RNA genes 


3 


4 


2 


3 


2 


74 


34 


Average pi of proteins 


8.4 


9.4 


8.6 


8.9 


9.8 


6.9 


6. 


Pseudogenes 


36 


13 


9 


6 


14 


99 


159 


Insertion sequence (IS) elements 


0 


0 


0 


0 


0 


42 


16 



This study. 
b Douglas (1998). 
c Nakabachi and Ishikawa (1999). 
d Wu et al. (2006). 
e Feldhaar et al. (2007). 
f Nogge(1981). 

flies, on the other hand, live on vertebrate blood, where the 
symbiont mainly provides B vitamins to the host (Nogge 
1 981 ; Akman et al. 2002). The common nutritional physiol- 
ogy of the host insects may be relevantto the higher similarity 
of gene content between Ishikawaella and Buchnera than 
that between Ishikawaella and Wigglesworthia (fig. 7), 
although Buchnera is phylogenetically closer to Wiggleswor- 
f/7/athanto/s/7//caivae//a(fig.3).Wesuggestthatthismaybean 
example of convergent genomic evolution in response to the 
common ecological necessity (e.g., Xu et al. 2007; Lopez- 
Sanchez et al. 2009; McCutcheon and Moran 201 0). 

Differences in Gene Repertoire between 
Ishikawaella and Endocellular Insect Symbionts 

Although the gene reduction patterns in Ishikawaella were 
strikingly similar to those in the endocellular symbionts, de- 
tailed pairwise comparisons revealed several noteworthy 
differences that may be relevant to functional and ecological 
aspects of these insect symbionts. For example, Ishikawaella 
possessed more genes for the synthesis of amino acids and 
cofactors than Buchnera and Blochmannia (supplementary 
table S6A and C, Supplementary Material online), which 
may indicate the broader nutritional capability of the extra- 
cellular symbiont than the endocellular symbionts and/or 
the younger coevolutionary history of the former than 
the latter. Meanwhile, Ishikawaella possessed more amino 
acid genes and less cofactor genes than Baumannia 



and Wigglesworthia (supplementary table S6C and D, 
Supplementary Material online), which clearly reflects their 
distinct biological roles; Ishikawaella as supplier of essential 
amino acids, whereas Baumannia and Wigglesworthia as 
suppliers of cofactors (Nogge 1981; Akman et al. 2002; 
Wu et al. 2006). Unexpectedly, the extracellular symbiont 
Ishikawaella retained fewer genes related to cell wall synthe- 
sis and lipid metabolism than the endocellular symbionts 
Blochmannia, Baumannia, and Wigglesworthia (supple- 
mentary table S6B-D, Supplementary Material online; fig. 
4). How Ishikawaella stands the extracellular condition with 
a small number of cell wall and membrane genes is an 
enigma. A previous histological work revealed that a symbi- 
ont capsule of M. punctatissima consists of three structural 
components: symbiont cells, secretion matrix, and chitinous 
envelope. The symbiont cells are embedded in the secretion 
matrix, and the envelope layer covers the surface of the cap- 
sule (Hosokawa et al. 2005). It seems that, although spec- 
ulative, the secretion matrix may play some roles in symbiont 
preservation outside the host body. Ishikawaella exhibited 
more genes related to energy production and conversion 
than the endocellular symbionts (supplementary table 
S6/A-D, Supplementary Material online), which was attribut- 
able to, at least partly, the complete TCA cycle genes in Ishi- 
kawaella in contrast to lack of those genes in the 
endocellular symbionts (supplementary fig. S3, Supplemen- 
tary Material online; figs. 4 and 5). Plausibly, availability of 
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metabolic intermediates in the host cytoplasm might have 
resulted in the evolutionary consequence in the endocellular 
symbionts, whereas the extracellular symbiont has to retain 
its own metabolic genes. Ishikawaella possessed no flagel- 
lar-related cell motility genes, whereas Buchnera and Wig- 
glesworthia retained many of them (supplementary table 
S6A and D, Supplementary Material online; fig. 4). In Buch- 
nera, numerous flagellar basal bodies were found on the cell 
membrane, which are suggested to mediate material trans- 
port from and to the host cytoplasm (Maezawa et al. 2006). 
Because reproduction of tsetse flies entails adenotrophic vi- 
viparity, Wigglesworthia might require flagellar motility for 
vertical transmission via milk gland secretion (Akman et al. 
2002; Attardo et al. 2007). Harbored extracellularly in the 
isolated gut cavity without food flow (Hosokawa et al. 
2005, 2006), Ishikawaella may require neither such trans- 
porters nor flagellar motors. 

Origin of Ishikawaella: Specialized Gut Bacterium 
or Ex-endocellular Symbiont? 

Whether the ancestor of Ishikawaella was an extracellular 
gut bacterium or an endocellular bacterium is an unan- 
swered question. Its extracellular habitat in the midgut cav- 
ity favors the hypothesis that Ishikawaella is a highly 
specialized gut bacterium. On the other hand, molecular 
phylogenetic analyses based on 50 ribosomal protein se- 
quences revealed that Ishikawaella is nested in a clade of 
endocellular insect symbionts of mutualistic nature with 
drastically reduced genomes, such as Buchnera, Blochman- 
nia, Baumannia, and Wigglesworthia (fig. 3), which raises 
an alternative hypothesis that their common ancestor was 
endocellular and Ishikawaella established the extracellular 
habitat secondarily. However, we note that the phylogenetic 
pattern requires careful interpretation: We cannot rule out 
the possibility that the apparent grouping of Ishikawaella 
with the insect endocellular symbionts was caused by their 
fast-evolving AT-rich gene sequences via the artifactual ef- 
fect so-called long-branch attraction (Herbeck et al. 2005). 
To address this question, more genome data and sophisti- 
cated analyses of allied y-proteobacterial endocellular and 
extracellular insect symbionts are needed. 

Estimation of Genome Contents and Gene Losses 
in the Evolutionary Course of Ishikawaella and 
Allied Endocellular Insect Symbionts 

Regardless of its endocellular or extracellular symbiotic sta- 
tus, the common ancestor of the Baumannia-lshikawaella- 
Buchnera-Blochmannia-Wigglesworthia clade must have 



Fig. 4. — Comparison of the metabolic gene repertoire between 
Ishikawaella, insect endocellular symbionts, and free-living y-proteobac- 
teria. The minimal number of genes for a metabolic pathway is shown in 
each of the brackets. Color indicates the ratio of retained genes to the 
minimal gene set for a metabolic pathway: green for 100%, orange for 



99-75%, yellow for 74-50%, pink for 49-25%, and gray for 24-0%. 
Asterisk denotes that the bacterium possesses an alternative pathway 
for biosynthesis of the final product. Number in the parentheses shows 
the minimal number of genes for the alternative pathway. 
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already experienced remarkable genome reduction. In an at- 
tempt to gain insights into the evolutionary process of the 
genome reduction, on the basis of whole genome sequen- 
ces of Ishikawaella and allied eight endocellular symbionts, 
we inferred a phylogenetic reconstruction of gene reper- 
toires for common ancestors of the insect symbionts. All 
protein-coding genes in the genomes were organized into 
1,021 OGGs based on reciprocal Blast searches, of which 
966 possessed homologs in E. coll genome, 48 repre- 
sented homologs in genomes of enteric bacteria other 
than E. coll, and 7 were unique to genomes of the insect 
symbionts and their close relatives (supplementary table 
S7, Supplementary Material online). On the basis of 
the phylogenetic relationship (fig. 3), estimated number 
of OGGs was mapped on each ancestral node, and 



estimated number of OGG losses was allocated to each 
ancestral branch (fig. 8; supplementary table S8, Supple- 
mentary Material online). The common ancestor of the 
insect symbionts was estimated to possess 1,021 OGGs, 
which is equivalent to genome size of about 1 .2 Mb. As 
for Ishikawaella, at least 343 OGGs were lost and 33 
OGGs were pseudogenized after divergence from the 
other insect symbionts, resulting in a 0.75 Mb genome 
with 608 OGGs. In the lineage leading to Ishikawaella, 
genes of the following categories were preferentially lost: 
Cell wall/membrane/envelope-related genes involved in 
synthesis of lipopolysaccharide, peptidoglycan, and outer 
membrane; cell mortility— related genes responsible for 
flagellar formation; and lipid and ion transport-related 
genes (fig. 8; supplementary table S8, Supplementary 
Material online). In each of the lineages leading to the 
other insect symbionts, preferential loss of specific gene 
sets was identified as follows: amino acid-related genes, 
cell wall/membrane/envelope-related genes, and cell mo- 
tility— related genes in the Baumannia lineage; amino 
acid-related genes in the lineage leading to the Wiggles- 
worthia lineage; cell motility— related genes in the Bloch- 
mannia spp. lineage; coenzyme-related genes, lipid and 
ion transport-related genes, and cell wall/membrane/en- 
velope-related genes in the Buchnera spp. lineage; and 
coenzyme-related genes, nucleotide-related genes, and 
cell wall/membrane/envelope-related genes in the Buch- 
nera str. Cc lineage where further genome reduction oc- 
curred (Perez-Brocal et al. 2006) (fig. 8; supplementary 
table S8, Supplementary Material online). Lastly, we note 
that the above arguments are based on the assumption 
that the symbiont phylogeny is correct, whereas the fast- 
evolving and AT-rich symbiont genomes are potentially 
prone to long-branch attraction and other artifacts in 
phylogenetic inferences (Herbeck et al. 2005). 

Plasmid-Encoded Genes of Ishikawaella 

The plasmid of Ishikawaella, pAst, carries four genes, astC, 
astA, astB, and astD (supplementary fig. S1, Supplemen- 
tary Material online), which encode enzymes of the am- 
monia-producing succinyltransferase (AST) pathway that 
yield glutamate and ammonia from arginine (Schneider 
et al. 1998). In many bacteria, the ast operon generally 
consists of five genes, astC, astA, astD, astB, and astE. 
In the case of Ishikawaella, the fifth gene, astE, is sepa- 
rately encoded in the chromosome (supplementary table 
S1 , Supplementary Material online). Upon nitrogen starva- 
tion, the AST pathway plays a principal role in E. coli for 
utilizing arginine as nitrogen source (Schneider et al. 
1998). The AST pathway may play a similar role in Ishika- 
waella, although it should be verified whether the food of 
the plataspid stinkbug, phloem sap of leguminous plants, 
contains a sufficient quantity of arginine for that purpose. 
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It is also notable that Ishikawaella may utilize astC for 
transamination in the synthetic pathways of arginine 
and lysine (fig. 6). In E. coll, not only astC but also argD 
catalyze these reactions (Riley and Glansdorff 1983), but 
in Ishikawaella, argD is encoded neither in the plasmid nor 
in the chromosome. 

The plasmid pAst also contains ode, a gene encoding 
oxalate decarboxylase (supplementary fig. S1, Supple- 
mentary Material online), which is widely distributed 
among fungi and bacteria (Makela et al. 2009). Oxalate 
is commonly present in higher plants and often accumu- 
lated at substantial concentrations in the plant biomass, 
the calcium salt of which can function as antiherbivore 
defensive agent (Franceschi and Nakata 2005). In the case 
of human, food-derived oxalate is degraded by a gut bac- 
terium Oxalobacter formigenes (Allison et al. 1985), and 
missing of this gut bacterium increases the risk of hyper- 
oxaluria and relevant disorders, such as the development 
of calcium oxalate kidney stones (Sidhu et al. 1999). The 
possibility that the plasmid of Ishikawaella may play a sim- 
ilar detoxifying role should be taken into account in future 
studies. 



Conclusion and Perspective 

Taking all these results together, we conclude that drastic 
genome reduction can occur even in gut symbiotic associ- 
ations. In this study, we demonstrated that general genomic 
features of the extracellular stinkbug symbiont Ishikawaella 
are very similar to those of allied endocellular insect sym- 
bionts, such as Buchnera, Baumannia, Blochmannia, and 
Wigglesworthia. Such genomic features include general 
gene repertoire, genome size, AT richness, elevated evolu- 
tionary rate, paucity of mobile genetic elements, etc. On 
the other hand, there are some differences between the 
gene repertoire of Ishikawaella and those of the endocellu- 
lar symbionts, which may be relevant to functional and 
physiological adaptations of the respective symbionts and 
can be attributed to preferential gene losses in each of 
the symbiont lineages. The strikingly similar evolutionary 
patterns shared between the extracellular and endocellular 
insect symbionts despite their apparently distinct ecological 
niches suggest some common evolutionary processes un- 
derpinning the symbiotic associations, which may be rele- 
vant to stable and nutrition-rich environment in the host 
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bacterial phyla; in parentheses are host organisms for endosymbionts. Colored bacterial names indicate red, obligate endocellular insect symbionts; 
blue, facultative endocellular insect symbionts/parasites; and green, obligate endocellular symbionts of non-arthropod organisms. Colored genome 
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body and also attenuated purifying selection due to small 
population size and strong bottleneck associated with the 
lifestyle of the vertically transmitted symbionts (Wernegreen 
2002; Moran et al. 2008; Moya et al. 2008). 

We point out that the secretion matrix embedding Ishi- 
kawaella cells within symbiont capsules of the plataspid 
stinkbug (Hosokawa et al. 2005) may be an important fac- 
tor that has contributed to the reductive evolution of the 
symbiont genome in the gut cavity. It seems likely, although 



speculative, that the secretion matrix is somehow mimick- 
ing the intracellular environment in the cytoplasm, protect- 
ing Ishikawaella against dehydration, irradiation, and 
shortage of metabolites that the symbiont cannot make 
on its own. In this context, biochemical composition and 
biological function of the secretion matrix are to be inves- 
tigated in depth. 

Are there any substantial differences between the evolu- 
tionary consequences of the extracellular and endocellular 
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symbiotic associations? Years ago, most sequenced genomes 
of obligate insect symbionts ranged from 0.6 to 1 Mb in size 
and contained more than 500 genes (see table 1). These val- 
ues are similar to the smallest known pathogen genomes (see 
fig. 3), and the Ishikawaella genome falls into that range. 
However, recent studies revealed that some endocellular in- 
sect symbionts exhibit genome sizes smaller than 0.5 Mb 
down to 0.14 Mb, which are almost approaching to the ex- 
tremely streamlined genomes of organelles (Nakabachi et al. 
2006; Perez-Brocal et al. 2006; McCutcheon and Moran 
2007, 2010; McCutcheon et al. 2009; McCutcheon 2010). 
It is difficult to imagine that such suborganellar microbes sur- 
vive extracellularly in the gut cavity. How small such extracel- 
lular gut symbiont genomes can be is of evolutionary interest, 
deserving future surveys of obligate gut symbiotic bacteria of 
diverse stinkbugs and other organisms. 

Finally, we point out an applied perspective of the Ishi- 
kawaella genome. A previous study demonstrated that 
the pest status of plataspid stinkbugs is determined by 
the Ishikawaella genotype rather than by the insect geno- 
type (Hosokawa, Kikuchi, Shimada, et al. 2007). Compar- 
ative analyses of the Ishikawaella genomes from the 
pest and nonpest stinkbug species would shed light on 
the mechanisms underlying the symbiont-mediated pest 
evolution. 

Supplementary Material 

Supplementary tables S1-S8 and figures S1-S3 are available 
at Genome Biology and Evolution online (http://www.gbe. 
oxfordjournals.org/). 
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